WHAT IS CIAIMED IS : 

1 . A method for extracting coinmands and acoustic 
data in a same utterance, comprising the steps of: 

decoding at least one word in acoustic data 
representing an acoustic signal that comprises a human 
utterance and determining acoustic word boundaries within 

the acoustic data; 

extracting at least one command in a decoded 

Utterance ; and 

identifying acoustic data segments in the utterance 

based on the acoustic word boundaries. 

2. The. method as recited in claim 1, wherein the 
step of determining acoustic word boundaries includes 
finding segment boundaries by iteratively comparing the 
same utterance to a plurality of vocabularies. 

3. The method as recited in claim 1, further 
comprising the step of executing the at least one cononand 
from the decoded utterance. 
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4. The method as recited in claim 3, further 
comprising at least one of storing the acoustic data 
segments and using the acoustic data segments in executing 
the at least one command. 

5. The method as recited in claim 3, further 
cor,»rising the step of submitting at least one non-co»and 
voice data segment for recognition using the recognizer 

vocabulary . 

6. The method as recited in claim 1. further 
comprising the step of changing a recognizer vocabulary. 

7. The method as recited in claim 1, further 
comprising the step of submitting the acoustic data 
segments for recognition when computing resources are 
available. 

8. The method as recited in claim 1, wherein the 
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Step Of extracting at least one coinmand from the utterance 
includes enploying one or more gra^ars to distinguish the 
command . 

9. The method as recited in claim 8, wherein the 
grammars include a form for extracting information for an 
order or verbal contract. 

10. The method as recited in claim 8, wherein the 
grammars include a form for reminding a user to perform a 

task. 

11. The method as recited in claim 8, wherein the 
grammars include a form for extracting maximum meaningful 
length segments under interruption or silence conditions. 

12. The method as recited in claim 8, wherein the step 
of using grammars includes the step of associating at least 
one grammar label with the corresponding segment of 
acoustic data that has been decoded into a command. 
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13. The method as recited in claim 12, wherein the 
label includes a numerical value associated with each 
command . 

14. The method as recited in claim 1, further 
comprising the step of executing the at least command in 
the utterance using undecoded acoustic data from within the 
same utterance. 

15. A program storage device readable by machine, 
tangibly entoodying a program of instructions executable by 
the machine to perform method steps for processing commands 
and voice data in a same utterance as recited in claim 1. 

16. A method for recognizing at least one command and 
at least one segment of acoustic voice data in a same 
utterance comprising the steps of: 

decoding at least one word in voice data representing 
the acoustic signal that comprises a human utterance and 
determining the acoustic word boundaries within the voice 
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data; 

extracting at least one co-^d from the utterance; 
associating segments in the voice data based on the 
acoustic word boundaries with labels. 

n. The method as recited in claim 16, wherein the 
step of extracting includes employing an application, which 
identifies commands in the utterance in accordance with the 

labels . 

18. The method as recited in claim 16, further 
comprising the step of executing the at least one cormand 
utilizing undecoded information in the acoustic voice data. 

19. The method as recited in claim 16, wherein the 
step of extracting includes the step of storing at least 
one non-command voice data segment. 

20. The method as recited in claim 16, wherein the 
step of extracting includes calling a vocabulary for 
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recognizing 
utterance . 



nuinbers and recognizing the numbers in the 



21. The method as recited in claim 16, wherein the 
step of extracting includes extracting acoustic data based 
on word boundaries and saving the acoustic data for 
acoustically rendering the acoustic data. 

22. The method as recited in claim 16. wherein the 
step of extracting includes extracting acoustic data based 
on word boundaries and decoding the acoustic data for 
Storage . 

23. The method as recited in claim 16, wherein the 
step of associating includes the step of changing a 
recognizer vocabulary and submitting at least one non- 
command voice data segment for recognition. 

24. The method as recited in claim 16, further 
comprising the step of buffering the utterance to be 
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processed and maintaining the utterance in memory during 
processing of the utterance. 

25. The method as recited in claim 16, wherein the 
step of associating time segments of the word boundaries of 
the commands with a label includes employing grarmnars to 
associate a unique label with each command segment in the 
utterance . 

26. The method as recited in claim 25. wherein the 
label includes a numerical value. 

27. The method as recited in claim 25, wherein the 
grammars include a form for extracting information for an 

order or verbal contract. 

28. The method as recited in claim 25, wherein the 
grannnars include a form for reminding a user to perform a 
task. 
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29. The method as recited in claim 25. wherein the 
grammars include a form for extracting maximum meaningful 
length segments under interruption or silence conditions. 

30. The method as recited in claim 16, wherein the 
step of determining the acoustic word boundaries includes 
finding segment boundaries by iteratively comparing the same 
utterance to a plurality of vocabularies. 

31. A program storage device readable by machine, 
tangibly embodying a program of instructions executable by 
the machine to perform method steps for recognizing 
commands and voice data in a same utterance as recited in 
claim 16- 

32 . A system for recognizing commands and voice data 
in a same utterance comprising: 

an acoustic input, which receives utterances; 

a data buffer, which stores audio data representing 

the utterances; 

a speech recognition engine, which matches portions of 
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the Utterances to acoustic models and language models to 
recognize words and word boundaries in the utterance and 
labels coranands in the utterance,- 

at least one program that executes label-identified 
con^ands and processes refining portions of the utterance 
in accordance with the comoands. 

33. The system as recited in claim 32, wherein the at 
least one program includes a function which searches the 
utterance for labels output from the speech recognition 
engine to execute a command associated with the label. 

34. The system as recited in claim 32, wherein, in 
accordance with each label, an audio segment is identified 

and processed. 

35. The system as recited in claim 32, wherein the 
speech recognition engine utilizes grammars with labels, 
which the system uses for assigning labels to decoded 

commands . 
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36. The syst«a as recited in claim 35, wherein the 
aran^ars are represented in Bachus-Naur For. (BNF, . 
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